Index Support for SPARQL

نویسندگان

  • Ralf Heese
  • Ulf Leser
  • Bastian Quilitz
  • Christian Rothe
چکیده

The Resource Description Framework (RDF) is the fundamental data model underlying the Semantic Web. Recently, SPARQL has been proposed as W3C standard for querying RDF repositories. As RDF is a graph-based data model, the core problem of evaluating SPARQL queries is matching query graph patterns against the data graph, which is computationally very expensive. We address this problem by indexing graph patterns. In the spirit of SQL, we assume that users select the patterns to be indexed. We formally define the problem of covering indexes, i.e., finding those indexes whose graph-patterns are contained in the query pattern, and derive formulas for estimating index selectivity. Finally, we study the problem of finding optimal sets of indexes for a given query. We believe that our framework is the first comprehensive suggestion for indexing RDF for SPARQL queries that goes beyond simple indexing of labels.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SPACE: SPARQL Index for Efficient Autocompletion

Querying Linked Data means to pose queries on various data sources without information about the data and the schema of the data. This demo shows SPACE, a tool to support autocompletion for SPARQL queries. It takes as input SPARQL query logs and builds an index structure for efficient and fast computation of query suggestions. To demonstrate SPACE, we use available query logs from the USEWOD Da...

متن کامل

SAMUEL: A Sharing-based Approach to processing Multiple SPARQL Queries with MapReduce

The volume of RDF data is now growing tremendously. It is thus considered prudent to store and process massive RDF data with distributed SPARQL engines instead of relying on a singlemachine system.Many sophisticated index and partitioning schemes have also been proposed to support SPARQL query evaluations. However, existing SPARQL engines have mainly followed oneat-a-time scheme so that query e...

متن کامل

Using an index of precomputed joins in order to speed up SPARQL processing

SparQL is a query language developed by the W3C, the purpose of which is to query a data set in RDF representing a directed graph. Many free available or commercial products already support SparQL processing. Current index-based optimizations integrated in these products typically construct indices on the subject, predicate and object of an RDF triple, which is a single datum of the RDF data, i...

متن کامل

Towards Efficient SPARQL Query Processing on RDF Data

Efficient support for querying large-scale RDF triples plays an important role in Semantic Web data management. This paper proposes an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing RDF triples. We first design and implement a set of operators on the inverted index for query optimization and evaluation. Then we propose a main-t...

متن کامل

Substring Filtering for Low-Cost Linked Data Interfaces

Recently, Triple Pattern Fragments (tpfs) were introduced as a low-cost server-side interface when high numbers of clients need to evaluate sparql queries. Scalability is achieved by moving part of the query execution to the client, at the cost of elevated query times. Since the tpf interface purposely does not support complex constructs such as sparql filters, queries that use them need to be ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006